Universal Access To All Knowledge
Home Donate | Store | Blog | FAQ | Jobs | Volunteer Positions | Contact | Bios | Forums | Projects | Terms, Privacy, & Copyright
Search: Advanced Search
Anonymous User (login or join us)
Upload

Reply to this post | See parent post | Go Back
View Post [edit]

Poster: jory2 Date: Jul 25, 2012 10:43am
Forum: web Subject: Re: Domainsponsor.com erasing prior archived copies of 135,000+ domains

@Jeremy Leader - I wasn't referring to any Disallow, I asked CoJaBo where he/she found the delete command.
Oddly enough I have never come across one for this website.

RE: Robots.txt, as I'm sure you know respecting the robots.txt file is voluntary, any informed site owner would not rely solely on such a file if their goal was to protect their Intellectual Property from being copied without permission or consent.

For what it's worth, the domain name _ _ _.com doesn't own the copyrights to the copyright protected material's that were used on that given domain name, the rightful owner of the works remains the owner.
Should the rightful owner wish not to renew or reactivate his/her website, that certainly doesn't mean the copyrights expired along with the domain name.
When the domain name is "parked" the copyrights and all Intellectual Property rights remain the property of the rightful owner. The rights are not automatically transferred to the new owner of the domain name, or companies like Domainsponsor.com.
You wrote: "I doubt they care about archives of their domains' previous history (before they bought them)."
What exactly do you think they bought?
Wouldn't common sense tell you and anyone else that all that was bought was the URL _ _ _.com, and not all the Rights to the Works that were used on the URL previously.








This post was modified by jory2 on 2012-07-25 17:43:56

Reply to this post
Reply [edit]

Poster: Jeremy Leader Date: Jul 25, 2012 1:50pm
Forum: web Subject: Re: Domainsponsor.com erasing prior archived copies of 135,000+ domains

Jory2, I believe I'm in complete agreement with you. My reference to a "delete directive" was in response to CoJaBo.

Can anyone confirm CoJaBo's claim that a directive like:

User-agent: *
Disallow: /

will NOT cause archive deletion (but will cause Internet Archive to stop crawling the site)?

Reply to this post
Reply [edit]

Poster: CoJaBo Date: Aug 7, 2012 4:03pm
Forum: web Subject: Re: Domainsponsor.com erasing prior archived copies of 135,000+^W 24 million+ domains

Disallowing the user agent "*" will not cause removal; it will stop crawling as expected, but only the specific user-agent specified to on the removal FAQ page will actually cause removal of past content.

I do agree that there should be something explicitly stating "Remove" in the directive to prevent such a mistake; others off-site have pointed out that some "bad bot blacklists" also include these lines without explanation of what "ia_archiver" actually is- its possible DomainSponsor got it from a source like that and didn't realize it would cause *removal* of the content from the Archive as it would have been far separated from the removal FAQ entry at that stage.

If anyone's been following the list of sites registered to their nameserver (that is, sites being removed from the Archive in this way), its increased nearly two-hundred-fold since I made this post; the current count is over 24 *million*.
I'm not sure if this indicates they are expanding that rapidly or simply that that particular index site is just catching up with their existing registrations; I suspect the latter to be more likely.

Reply to this post
Reply [edit]

Poster: Jeremy Leader Date: Aug 7, 2012 4:37pm
Forum: web Subject: Re: Domainsponsor.com erasing prior archived copies of 135,000+^W 24 million+ domains

OK, CoJaBo, thanks for that clarification.

So there's no way to say "Internet Archive, don't crawl my site, but don't delete the archive", while still allowing other crawlers to crawl the site?

Reply to this post
Reply [edit]

Poster: CoJaBo Date: Aug 7, 2012 4:40pm
Forum: web Subject: Re: Domainsponsor.com erasing prior archived copies of 135,000+^W 24 million+ domains

It doesn't seem so; the FAQ only mentions those lines for removal, it doesn't seem to give an option for "don't crawl the site anymore, but still keep the existing content".

I had hoped someone from either DomainSponsor or the Archive would have responded to my emails by now.